Tactile launching of an asymmetric visual communication session

ABSTRACT

Various arrangements for initiating and conducting asymmetrical video communication are presented. A streaming video and audio device may detect an acceleration of the streaming video and audio device over a predefined acceleration threshold. In response, the streaming video and audio device may transmit an acceleration-triggered asymmetrical video communication request to a cloud-based communication server system. The cloud-based communication server system may determine a user account that is linked with the video and audio capture device from which the acceleration-triggered asymmetrical video communication request was received. In response, an acceleration-triggered asymmetrical video communication link may be established between the video and audio capture device and an application executed by a mobile device.

BACKGROUND

Smart devices are becoming more prevalent in homes, offices, and otherlocations. One common smart device is a streaming video camera. Such astreaming video camera may be conspicuously or inconspicuously placedindoors or outdoors. Such a streaming video camera typically transmits avideo and audio stream to a remote device or server system for storage.An authorized user may then use a computerized device to access and viewthe video and listen to the audio.

SUMMARY

Various embodiments are described related to an asymmetrical videocommunication system. In some embodiments, an asymmetrical videocommunication system is described. The system may include an applicationexecuted using one or more processors of a mobile device. The mobiledevice may include a display, a microphone, and a speaker. The systemmay include a video and audio capture device including a housing, one ormore processors housed by the housing, a video camera housed by thehousing, a microphone housed by the housing, a speaker housed by thehousing, and an accelerometer housed by the housing. The video and audiocapture device may not have an electronic display. In response todetecting an acceleration over a predefined acceleration threshold, anacceleration-triggered asymmetrical video communication request may becaused to be transmitted to a cloud-based communication server system.The cloud-based communication server system may be configured to receivethe acceleration-triggered asymmetrical video communication request fromthe video and audio capture device. The system may determine a useraccount from a plurality of user accounts that is linked with the videoand audio capture device from which the acceleration-triggeredasymmetrical video communication request was received. The system mayestablish an acceleration-triggered asymmetrical video communicationlink between the application executed by the mobile device and the videoand audio capture device. The video and audio captured using the videoand audio capture device may be streamed via the cloud-basedcommunication server system to the application and audio captured by themobile device may be streamed via the cloud-based communication serversystem to the video and audio capture device. The video may not bestreamed from the mobile device to the cloud-based communication serversystem or to the video and audio capture device.

Embodiments of a such a system may include one or more of the followingfeatures: The cloud-based communication server system may be configuredto transmit a request to the application executed by the mobile devicebased on the application being linked with the user account in responseto receiving the acceleration-triggered asymmetrical video communicationrequest. The mobile device may be configured to present a notificationindicative of the request in response to receiving the request for theacceleration-triggered asymmetrical video communication link. Inresponse to user input provided in response to the presentednotification, the application executed by the mobile device may beconfigured to cause an activation request to be transmitted to thecloud-based communication server system. The system may be configured toestablish the acceleration-triggered asymmetrical video communicationlink between the application executed by the mobile device and the videoand audio capture device based on receipt of the activation request. Thesystem may be configured to transmit a picture captured by the video andaudio capture device as part of the notification. The picture may becaptured based on the video and audio capture device detecting theacceleration over the predefined acceleration threshold. The system maybe configured to transmit a video clip captured by the video and audiocapture device as part of the notification. The video clip may becaptured based on the video and audio capture device detecting theacceleration over the predefined acceleration threshold. The video clipmay temporally span from before the video and audio capture devicedetected the acceleration over the predefined acceleration thresholduntil after the video and audio capture device detected the accelerationover the predefined acceleration threshold. The acceleration over thepredefined acceleration threshold may be indicative of a user performinga tap on the housing of the video and audio capture device. Theacceleration-triggered asymmetrical video communication request to betransmitted to a cloud-based communication server system may betransmitted by the video and audio capture device in response todetecting multiple distinct instances of acceleration greater than thepredefined acceleration threshold with a predefined period of time. Thevideo and audio capture device may be configured to transmit a patternindication to the cloud-based communication server system. The patternindication may indicate a number of distinct instances of accelerationgreater than the predefined acceleration threshold within the predefinedperiod of time. The system may be configured to determine the useraccount from the plurality of user accounts at least partially based onthe pattern indication. The video and audio capture device may include alight and the video and audio capture device may be configured to changean illumination state of the light in response to theacceleration-triggered asymmetrical video communication request beingtransmitted.

In some embodiments, a method for initiating and conducting asymmetricalvideo communication is described. The method may include detecting, by astreaming video and audio device, an acceleration of the streaming videoand audio device over a predefined acceleration threshold. The methodmay include transmitting, by a streaming video and audio device, anacceleration-triggered asymmetrical video communication request to acloud-based communication server system in response to detecting theacceleration of the streaming video and audio device over the predefinedacceleration threshold. The video and audio capture device may not havean electronic display. The method may include receiving, by thecloud-based communication server system, the acceleration-triggeredasymmetrical video communication request from the video and audiocapture device. The method may include determining, by the cloud-basedcommunication server system, a user account that is linked with thevideo and audio capture device from which the acceleration-triggeredasymmetrical video communication request was received. The method mayinclude establishing, by the cloud-based communication server system, anacceleration-triggered asymmetrical video communication link between thevideo and audio capture device and an application executed by a mobiledevice that may include a display, microphone, and speaker. The methodmay include streaming video and audio captured using the video and audiocapture device via the cloud-based communication server system to theapplication and audio captured by the mobile device may be streamed viathe cloud-based communication server system to the video and audiocapture device.

Embodiments of such a method may include one or more of the followingfeatures: The method may include transmitting, by the cloud-basedcommunication server system, a request to the application executed bythe mobile device based on the application being linked with the useraccount in response to receiving the acceleration-triggered asymmetricalvideo communication request. The method may include presenting, by theapplication of the mobile device, a notification indicative of therequest in response to receiving the request for theacceleration-triggered asymmetrical video communication link. The methodmay include causing, by the application an activation request to betransmitted to the cloud-based communication server system in responseto user input provided in response to the presented notification. Themethod may include establishing, by the cloud-based communication serversystem, the acceleration-triggered asymmetrical video communication linkbetween the application executed by the mobile device and the video andaudio capture device based on receipt of the activation request. Themethod may include transmitting, by the cloud-based communication serversystem, a picture captured by the video and audio capture device as partof the notification. The picture may be captured based on the video andaudio capture device detecting the acceleration over the predefinedacceleration threshold. The method may include transmitting, by thecloud-based communication server system, as part of the notification, avideo clip captured by the video and audio capture device. The videoclip may be captured based on the video and audio capture devicedetecting the acceleration over the predefined acceleration threshold.The video clip may temporally span from before the video and audiocapture device detected the acceleration over the predefinedacceleration threshold until after the video and audio capture devicedetected the acceleration over the predefined acceleration threshold.The acceleration-triggered asymmetrical video communication request maybe transmitted to the cloud-based communication server system by thevideo and audio capture device in response to detecting multipledistinct instances of acceleration greater than the predefinedacceleration threshold within a predefined period of time. The methodmay include transmitting, by the video and audio capture device, apattern indication to the cloud-based communication server system. Thepattern indication may indicate a number of distinct instances ofacceleration greater than the predefined acceleration threshold withinthe predefined period of time. The method may include selecting, by thecloud-based communication server system, the user account from aplurality of user accounts at least partially based on the patternindication. The method may include altering, by the video and audiodevice, an illumination state of a light in response to theacceleration-triggered asymmetrical video communication request beingtransmitted.

BRIEF DESCRIPTION OF THE DRAWINGS

A further understanding of the nature and advantages of variousembodiments may be realized by reference to the following figures. Inthe appended figures, similar components or features may have the samereference label. Further, various components of the same type may bedistinguished by following the reference label by a dash and a secondlabel that distinguishes among the similar components. If only the firstreference label is used in the specification, the description isapplicable to any one of the similar components having the same firstreference label irrespective of the second reference label.

FIG. 1 illustrates an embodiment of a smart home environment.

FIG. 2 illustrates a block diagram of a representative networkarchitecture that includes a smart home network.

FIG. 3 illustrates a block diagram representative of an operatingenvironment in which a cloud-based server system provides dataprocessing for monitoring and facilitating review of events (e.g.,motion, audio, security, etc.) in video and audio streams captured byvideo and audio capture devices.

FIG. 4 illustrates an embodiment of an asymmetric video communicationsystem.

FIG. 5 illustrates an embodiment a cloud-based communication serversystem.

FIG. 6 illustrates an embodiment of a person requesting an asymmetriccommunication link via a video and audio capture device.

FIG. 7 illustrates an embodiment of a method for establishing anasymmetric communication link.

FIG. 8 illustrates an embodiment of a method for enabling asymmetricalvideo communication for a video and audio capture device.

DETAILED DESCRIPTION

A video and audio capture device, such as a streaming video camera, maycapture video and, possibly, audio. This video and audio may be streamedor otherwise transmitted for remote storage. For instance, a cloud-basedhost server may store video and audio captured by the capture device foran indefinite or rolling window of time. Such a capture device mayprimarily be used for capturing video and audio. Therefore, such acapture device might have a speaker and/or a status light, but may nothave a display capable of presenting video. Further, other hardware andassociated features may be absent from such a capture device. Forinstance, no user interface, such as one or more buttons, may bepresent. In other cases, even if there are some buttons or controlinputs provided on the device, those items are often designed to beaccessed solely or primarily by a technician or a sophisticated userduring initial device configuration or troubleshooting, and thus areoften compact and/or hidden and not meant for regular everydayinteraction with general users.

Video and audio captured by such a capture device is viewed by a user,who is usually remote from the capture device (e.g., in a differentroom, on a different floor, in a different building, in a different cityor other geography, etc.) using a computerized communication device thatincludes a display. For instance, a smartphone, tablet computer, laptopcomputer, baby monitor, or desktop computer may be used for viewing suchvideo and audio. Such devices typically have a display, a speaker, and amicrophone. The video and audio may be viewed “live” or in “real time”(i.e., the video and audio is streamed to the computerized communicationdevice over a data network, such as by a remote host server or in somecases directly from device-to-device, as soon as practicable uponreceipt from the capture device). As such, a user can view,approximately in real time, what is occurring within the field-of-viewof the capture device. Since the capture device may have a microphoneand a speaker, the user viewing the audio and video on the computerizedcommunication device can interact (e.g., have a conversation) with aperson present near the capture device via a live communication link.

Such live communication, however, can be asymmetrical in multiplerespects. First, while the user of the computerized communication devicecan view video captured by the capture device, the person present nearthe capture device may not have access to a video feed of the user onthe capture device. Second, when a user of the computerizedcommunication device desires to establish a live communication link withthe capture device, the user could use the computerized device to launchan application or webpage that establishes the live communicationchannel. However, the person near the capture device may typically haveno way (or a very difficult way) of using the capture device to directlyinitiate such a live communication link with the user.

In embodiments detailed herein, the capture device may be equipped witha sensor, such as an accelerometer, that can detect physical input(e.g., tactile input) provided to the capture device, such as one or aseries of physical “taps.” Such a capture device may not have any userinput buttons or a display screen. Such one or more taps may trigger arequest to be transmitted to a cloud-based server system which also maybe used to store captured video and audio. In response to this request,the cloud-based server system may determine: 1) a user account linkedwith the capture device; and 2) a computerized communication device tobe contacted based on the request. The cloud-based server system mayestablish an asymmetric communication channel between the computerizedcommunication device and the capture device (in which only thecomputerized device presents video captured by the capture device) inresponse to the request. Alternatively, in response to the request, anotification may be transmitted to the computerized communication devicethat gives a user of the computerized communication device the option ofestablishing the live communication channel with the capture device.Such an arrangement allows a person in the vicinity of the capturedevice to either initiate or request a live communication channel with auser of the computerized communication device.

Such an arrangement has many uses, but may be particularly useful incertain scenarios, such as when a person nearby the capture device doesnot readily have access to another communication device, such as asmartphone. For instance, a capture device may primarily be used as asecurity camera in or near a home. Such a security camera may be used torecord video of children, residents, nannies, and visitors to the home(e.g., repairmen, workmen), collectively referred to as “occupants.”When an occupant wishes to contact a user who has been linked with thecapture device, the occupant may provide physical input to the capturedevice, such as by tapping the capture device one or more times. Thistapping may initiate or request a live communication channel with theuser. A possible use case is a child desiring to contact a parent.Another possible use case is a workman desiring to contact a homeowner.Another possible use case is an elderly parent desiring to contact anadult child. Any of such persons may provide designated physical inputsuch as one or multiple taps to the capture device that may trigger theestablishment or a request for a live communication link with adesignated user.

Further detail regarding such embodiments is provided in relation to thefigures. FIG. 1 illustrates an embodiment of a smart home environment100 in accordance with some implementations detailed herein. The smarthome environment 100 includes a structure 150 (e.g., a house, officebuilding, garage, or mobile home) with various integrated devices. Itwill be appreciated that devices may also be integrated into a smarthome environment 100 that does not include an entire structure 150, suchas an apartment, condominium, or office space. Further, the smart homeenvironment 100 may control and/or be coupled to devices outside of theactual structure 150. Indeed, several devices in the smart homeenvironment 100 need not be physically within the structure 150. Forexample, a device controlling a pool heater 114 or irrigation system 116may be located outside of the structure 150.

It is to be appreciated that “smart home environments” may refer tosmart environments for homes such as a single-family house, but thescope of the present teachings is not so limited. The present teachingsare also applicable, without limitation, to duplexes, townhomes,multi-unit apartment buildings, hotels, retail stores, office buildings,industrial buildings, and more generally any living space or work space.

It is also to be appreciated that while the terms user, customer,installer, homeowner, occupant, guest, tenant, landlord, repair person,and the like may be used to refer to the person or persons acting in thecontext of some particular situations described herein, these referencesdo not limit the scope of the present teachings with respect to theperson or persons who are performing such actions. Thus, for example,the terms user, customer, purchaser, installer, subscriber, andhomeowner may often refer to the same person in the case of asingle-family residential dwelling, because the head of the household isoften the person who makes the purchasing decision, buys the unit, andinstalls and configures the unit, and is also one of the users of theunit. However, in other scenarios, such as a landlord-tenantenvironment, the customer may be the landlord with respect to purchasingthe unit, the installer may be a local apartment supervisor, a firstuser may be the tenant, and a second user may again be the landlord withrespect to remote control functionality. Importantly, while the identityof the person performing the action may be germane to a particularadvantage provided by one or more of the implementations, such identityshould not be construed in the descriptions that follow as necessarilylimiting the scope of the present teachings to those particularindividuals having those particular identities.

The depicted structure 150 includes a plurality of rooms 152, separatedat least partly from each other via walls 154. The walls 154 may includeinterior walls or exterior walls. Each room may further include a floor156 and a ceiling 158. Devices may be mounted on, integrated with and/orsupported by a wall 154, floor 156 or ceiling 158.

In some implementations, the integrated devices of the smart homeenvironment 100 include intelligent, multi-sensing, network-connecteddevices that integrate seamlessly with each other in a smart homenetwork (e.g., 202 FIG. 2) and/or with a central server or acloud-computing system to provide a variety of useful smart homefunctions. The smart home environment 100 may include one or moreintelligent, multi-sensing, network-connected thermostats 102(hereinafter referred to as “smart thermostats 102”), one or moreintelligent, network-connected, multi-sensing hazard detection units 104(hereinafter referred to as “smart hazard detectors 104”), one or moreintelligent, multi-sensing, network-connected entryway interface devices106 and 120 (hereinafter referred to as “smart doorbells 106” and “smartdoor locks 120”), and one or more intelligent, multi-sensing,network-connected alarm systems 122 (hereinafter referred to as “smartalarm systems 122”).

In some implementations, the one or more smart thermostats 102 detectambient climate characteristics (e.g., temperature and/or humidity) andcontrol an HVAC system 103 accordingly. For example, a respective smartthermostat 102 includes an ambient temperature sensor.

The one or more smart hazard detectors 104 may include thermal radiationsensors directed at respective heat sources (e.g., a stove, oven, otherappliances, a fireplace, etc.). For example, a smart hazard detector 104in a kitchen 153 includes a thermal radiation sensor directed at astove/oven 112. A thermal radiation sensor may determine the temperatureof the respective heat source (or a portion thereof) at which it isdirected and may provide corresponding blackbody radiation data asoutput.

The smart doorbell 106 and/or the smart door lock 120 may detect aperson's approach to or departure from a location (e.g., an outer door),control doorbell/door locking functionality (e.g., receive user inputsfrom a portable electronic device 166-1 to actuate bolt of the smartdoor lock 120), announce a person's approach or departure via audio orvisual means, and/or control settings on a security system (e.g., toactivate or deactivate the security system when occupants go and come).In some implementations, the smart doorbell 106 includes some or all ofthe components and features of the camera 118-1. In someimplementations, the smart doorbell 106 includes a camera 118-1, andtherefore, is also called “doorbell camera 106” in this application.Cameras 118-1 and/or 118-2 may function as the video and audio capturedevice detailed in relation to various embodiments herein. Camera 118may be mounted in a location, such as indoors and to a wall. Camera118-2 may function similarly to camera 118-1, but may be placed on asurface. Various embodiments of cameras 118 may be installed indoors oroutdoors.

The smart alarm system 122 may detect the presence of an individualwithin close proximity (e.g., using built-in IR sensors), sound an alarm(e.g., through a built-in speaker, or by sending commands to one or moreexternal speakers), and send notifications to entities or userswithin/outside of the smart home environment 100. In someimplementations, the smart alarm system 122 also includes one or moreinput devices or sensors (e.g., keypad, biometric scanner, NFCtransceiver, microphone) for verifying the identity of a user, and oneor more output devices (e.g., display, speaker). In someimplementations, the smart alarm system 122 may also be set to an“armed” mode, such that detection of a trigger condition or event causesthe alarm to be sounded unless a disarming action is performed.

In some implementations, the smart home environment 100 includes one ormore intelligent, multi-sensing, network-connected wall switches 108(hereinafter referred to as “smart wall switches 108”), along with oneor more intelligent, multi-sensing, network-connected wall pluginterfaces 110 (hereinafter referred to as “smart wall plugs 110”). Thesmart wall switches 108 may detect ambient lighting conditions, detectroom-occupancy states, and control a power and/or dim state of one ormore lights. In some instances, smart wall switches 108 may also controla power state or speed of a fan, such as a ceiling fan. The smart wallplugs 110 may detect occupancy of a room or enclosure and control supplyof power to one or more wall plugs (e.g., such that power is notsupplied to the plug if nobody is at home).

In some implementations, the smart home environment 100 of FIG. 1includes a plurality of intelligent, multi-sensing, network-connectedappliances 112 (hereinafter referred to as “smart appliances 112”), suchas refrigerators, stoves, ovens, televisions, washers, dryers, lights,stereos, intercom systems, garage-door openers, floor fans, ceilingfans, wall air conditioners, pool heaters, irrigation systems, securitysystems, space heaters, window AC units, motorized duct vents, and soforth. In some implementations, when plugged in, an appliance mayannounce itself to the smart home network, such as by indicating whattype of appliance it is, and it may automatically integrate with thecontrols of the smart home. Such communication by the appliance to thesmart home may be facilitated by either a wired or wirelesscommunication protocol. The smart home may also include a variety ofnon-communicating legacy appliances 140, such as old conventionalwasher/dryers, refrigerators, and the like, which may be controlled bysmart wall plugs 110. The smart home environment 100 may further includea variety of partially communicating legacy appliances 142, such asinfrared (“IR”) controlled wall air conditioners or other IR-controlleddevices, which may be controlled by IR signals provided by the smarthazard detectors 104 or the smart wall switches 108.

In some implementations, the smart home environment 100 includes one ormore network-connected cameras 118 that are configured to provide videomonitoring and security in the smart home environment 100. The cameras118 may be used to determine occupancy of the structure 150 and/orparticular rooms 152 in the structure 150, and thus may act as occupancysensors. For example, video captured by the cameras 118 may be processedto identify the presence of an occupant in the structure 150 (e.g., in aparticular room 152). Specific individuals may be identified based, forexample, on their appearance (e.g., height, face) and/or movement (e.g.,their walk/gait). Cameras 118 may additionally include one or moresensors (e.g., IR sensors, motion detectors), input devices (e.g.,microphone for capturing audio), and output devices (e.g., speaker foroutputting audio). In some implementations, the cameras 118 are eachconfigured to operate in a day mode and in a low-light mode (e.g., anight mode). In some implementations, the cameras 118 each include oneor more IR illuminators for providing illumination while the camera isoperating in the low-light mode. In some implementations, the cameras118 include one or more outdoor cameras. In some implementations, theoutdoor cameras include additional features and/or components such asweatherproofing and/or solar ray compensation.

The smart home environment 100 may additionally or alternatively includeone or more other occupancy sensors (e.g., the smart doorbell 106, smartdoor locks 120, touch screens, IR sensors, microphones, ambient lightsensors, motion detectors, smart nightlights 170, etc.). In someimplementations, the smart home environment 100 includes radio-frequencyidentification (RFID) readers (e.g., in each room 152 or a portionthereof) that determine occupancy based on RFID tags located on orembedded in occupants. For example, RFID readers may be integrated intothe smart hazard detectors 104.

The smart home environment 100 may also include communication withdevices outside of the physical home but within a proximate geographicalrange of the home. For example, the smart home environment 100 mayinclude a pool heater monitor 114 that communicates a current pooltemperature to other devices within the smart home environment 100and/or receives commands for controlling the pool temperature.Similarly, the smart home environment 100 may include an irrigationmonitor 116 that communicates information regarding irrigation systemswithin the smart home environment 100 and/or receives controlinformation for controlling such irrigation systems.

By virtue of network connectivity, one or more of the smart home devicesof FIG. 1 may further allow a user to interact with the device even ifthe user is not proximate to the device. For example, a user maycommunicate with a device using a computer (e.g., a desktop computer,laptop computer, or tablet) or other portable electronic device 166(e.g., a mobile phone, such as a smart phone). A webpage or applicationmay be configured to receive communications from the user and controlthe device based on the communications and/or to present informationabout the device's operation to the user. For example, the user may viewa current set point temperature for a device (e.g., a stove) and adjustit using a computer. The user may be in the structure during this remotecommunication or outside the structure.

As discussed above, users may control smart devices in the smart homeenvironment 100 using a network-connected computer or portableelectronic device 166. In some examples, some or all of the occupants(e.g., individuals who live in the home) may register their device 166with the smart home environment 100. Such registration may be made at acentral server to authenticate the occupant and/or the device as beingassociated with the home and to give permission to the occupant to usethe device to control the smart devices in the home. An occupant may usetheir registered device 166 to remotely control the smart devices of thehome, such as when the occupant is at work or on vacation. The occupantmay also use their registered device to control the smart devices whenthe occupant is actually located inside the home, such as when theoccupant is sitting on a couch inside the home. It should be appreciatedthat instead of or in addition to registering devices 166, the smarthome environment 100 may make inferences about which individuals live inthe home and are therefore occupants and which devices 166 areassociated with those individuals. As such, the smart home environmentmay “learn” who is an occupant and permit the devices 166 associatedwith those individuals to control the smart devices of the home.

In some implementations, in addition to containing processing andsensing capabilities, devices 102, 104, 106, 108, 110, 112, 114, 116,118, 120, and/or 122 (collectively referred to as “the smart devices”)are capable of data communications and information sharing with othersmart devices, a central server or cloud-computing system, and/or otherdevices that are network-connected. Data communications may be carriedout using any of a variety of custom or standard wireless protocols(e.g., IEEE 802.15.4, Wi-Fi, ZigBee, 6LoWPAN, Thread, Z-Wave, BluetoothSmart, ISA100.5A, WirelessHART, MiWi, etc.) and/or any of a variety ofcustom or standard wired protocols (e.g., Ethernet, HomePlug, etc.), orany other suitable communication protocol, including communicationprotocols not yet developed as of the filing date of this document.

In some implementations, the smart devices serve as wireless or wiredrepeaters. In some implementations, a first one of the smart devicescommunicates with a second one of the smart devices via a wirelessrouter. The smart devices may further communicate with each other via aconnection (e.g., network interface 160) to a network, such as theInternet. Through the Internet, the smart devices may communicate with acloud-based server system 164 (also called a central server systemand/or a cloud-computing system herein). Cloud-based server system 164may be associated with a manufacturer, support entity, or serviceprovider associated with the smart device(s). In some implementations, auser is able to contact customer support using a smart device itselfrather than needing to use other communication means, such as atelephone or Internet-connected computer. In some implementations,software updates are automatically sent from cloud-based server system164 to smart devices (e.g., when available, when purchased, or atroutine intervals).

In some implementations, the network interface 160 includes aconventional network device (e.g., a router), and the smart homeenvironment 100 of FIG. 1 includes a hub device 180 that iscommunicatively coupled to the network(s) 162 directly or via thenetwork interface 160. The hub device 180 is further communicativelycoupled to one or more of the above intelligent, multi-sensing,network-connected devices (e.g., smart devices of the smart homeenvironment 100). Each of these smart devices optionally communicateswith the hub device 180 using one or more radio communication networksavailable at least in the smart home environment 100 (e.g., ZigBee,Z-Wave, Insteon, Bluetooth, Wi-Fi and other radio communicationnetworks). In some implementations, the hub device 180 and devicescoupled with/to the hub device can be controlled and/or interacted withvia an application running on a smart phone, household controller,laptop, tablet computer, game console or similar electronic device. Insome implementations, a user of such controller application can view thestatus of the hub device or coupled smart devices, configure the hubdevice to interoperate with smart devices newly introduced to the homenetwork, commission new smart devices, and adjust or view settings ofconnected smart devices, etc. In some implementations the hub deviceextends capabilities of low capability smart device to matchcapabilities of the highly capable smart devices of the same type,integrates functionality of multiple different device types—even acrossdifferent communication protocols, and is configured to streamlineadding of new devices and commissioning of the hub device. In someimplementations, hub device 180 further includes a local storage devicefor storing data related to, or output by, smart devices of smart homeenvironment 100. In some implementations, the data includes one or moreof: video data output by a camera device, metadata output by a smartdevice, settings information for a smart device, usage logs for a smartdevice, and the like.

In some implementations, smart home environment 100 includes a localstorage device 190 for storing data related to, or output by, smartdevices of smart home environment 100. In some implementations, the dataincludes one or more of: video data output by a camera device (e.g.,cameras 118 or doorbell camera 106), metadata output by a smart device,settings information for a smart device, usage logs for a smart device,and the like. In some implementations, local storage device 190 iscommunicatively coupled to one or more smart devices via a smart homenetwork (e.g., smart home network 202, FIG. 2). In some implementations,local storage device 190 is selectively coupled to one or more smartdevices via a wired and/or wireless communication network. In someimplementations, local storage device 190 is used to store video datawhen external network conditions are poor. For example, local storagedevice 190 is used when an encoding bitrate of cameras 118 exceeds theavailable bandwidth of the external network (e.g., network(s) 162). Insome implementations, local storage device 190 temporarily stores videodata from one or more cameras (e.g., cameras 118) prior to transferringthe video data to a server system (e.g., server system 164).

Further included and illustrated in the exemplary smart-home environment100 of FIG. 1 are service robots 168, each configured to carry out, inan autonomous manner, any of a variety of household tasks. For someembodiments, the service robots 168 can be respectively configured toperform floor sweeping, floor washing, etc. in a manner similar to thatof known commercially available devices such as the Roomba™ and Scooba™products sold by iRobot, Inc. of Bedford, Mass. Tasks such as floorsweeping and floor washing can be considered as “away” or “while-away”tasks for purposes of the instant description, as it is generally moredesirable for these tasks to be performed when the occupants are notpresent. For other embodiments, one or more of the service robots 168are configured to perform tasks such as playing music for an occupant,serving as a localized thermostat for an occupant, serving as alocalized air monitor/purifier for an occupant, serving as a localizedbaby monitor, serving as a localized hazard detector for an occupant,and so forth, it being generally more desirable for such tasks to becarried out in the immediate presence of the human occupant. Forpurposes of the instant description, such tasks can be considered as“human-facing” or “human-centric” tasks.

When serving as a localized air monitor/purifier for an occupant, aparticular service robot 168 can be considered to be facilitating whatcan be called a “personal health-area network” for the occupant, withthe objective being to keep the air quality in the occupant's immediatespace at healthy levels. Alternatively or in conjunction therewith,other health-related functions can be provided, such as monitoring thetemperature or heart rate of the occupant (e.g., using finely remotesensors, near-field communication with on-person monitors, etc.). Whenserving as a localized hazard detector for an occupant, a particularservice robot 168 can be considered to be facilitating what can becalled a “personal safety-area network” for the occupant, with theobjective being to ensure there is no excessive carbon monoxide, smoke,fire, etc., in the immediate space of the occupant. Methods analogous tothose described above for personal comfort-area networks in terms ofoccupant identifying and tracking are likewise applicable for personalhealth-area network and personal safety-area network embodiments.

According to some embodiments, the above-referenced facilitation ofpersonal comfort-area networks, personal health-area networks, personalsafety-area networks, and/or other such human-facing functionalities ofthe service robots 168, are further enhanced by logical integration withother smart sensors in the home according to rules-based inferencingtechniques or artificial intelligence techniques for achieving betterperformance of those human-facing functionalities and/or for achievingthose goals in energy-conserving or other resource-conserving ways.Thus, for one embodiment relating to personal health-area networks, theair monitor/purifier service robot 168 can be configured to detectwhether a household pet is moving toward the currently settled locationof the occupant (e.g., using on-board sensors and/or by datacommunications with other smart-home sensors along with rules-basedinferencing/artificial intelligence techniques), and if so, the airpurifying rate is immediately increased in preparation for the arrivalof more airborne pet dander. For another embodiment relating to personalsafety-area networks, the hazard detector service robot 168 can beadvised by other smart-home sensors that the temperature and humiditylevels are rising in the kitchen, which is nearby the occupant's currentdining room location, and responsive to this advisory, the hazarddetector service robot 168 will temporarily raise a hazard detectionthreshold, such as a smoke detection threshold, under an inference thatany small increases in ambient smoke levels will most likely be due tocooking activity and not due to a genuinely hazardous condition.

Cameras 118 any other smart home device that has an integrated cameramay have one or more sensors incorporated that can detect movement ortactile input. For instance, one or more accelerometers may beincorporated as part of cameras 118 and 118-2 to sense when each camerais moved and/or when each camera is tapped or otherwise touched. Cameras118 may be incorporated as part of various smart home devices inaddition to smart doorbell 106. For instance, a camera similar tocameras 118 may be incorporated as part of thermostat 102, appliance112, hub device 180, service robot 168, or hazard detector units 104.Each of these devices may be considered a form of video and audiocapture device. Such devices may not have a display on which to presentvideo. Further, such device may not have a dedicated user input device,such as one or more buttons, switches, touchpad interfaces, keypads, ortouchscreens.

FIG. 2 is a block diagram illustrating a representative networkarchitecture 200 that includes a smart home network 202. In someimplementations, the smart devices 204 in the smart home environment 100(e.g., devices 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, and/or122) combine with the hub device 180 to create a mesh network in smarthome network 202. In some implementations, one or more smart devices 204in the smart home network 202 operate as a smart home controller.Additionally and/or alternatively, hub device 180 operates as the smarthome controller. In some implementations, a smart home controller hasmore computing power than other smart devices. In some implementations,a smart home controller processes inputs (e.g., from smart devices 204,electronic device 166, and/or server system 164) and sends commands(e.g., to smart devices 204 in the smart home network 202) to controloperation of the smart home environment 100. In some implementations,some of the smart devices 204 in the smart home network 202 (e.g., inthe mesh network) are “spokesman” nodes (e.g., 204-1) and others are“low-powered” nodes (e.g., 204-9). Some of the smart devices in thesmart home environment 100 are battery powered, while others have aregular and reliable power source, such as by connecting to wiring(e.g., to 120V or 240V line voltage wires) behind the walls 154 of thesmart home environment. The smart devices that have a regular andreliable power source are referred to as “spokesman” nodes. These nodesare typically equipped with the capability of using a wireless protocolto facilitate bidirectional communication with a variety of otherdevices in the smart home environment 100, as well as with the serversystem 164. In some implementations, one or more “spokesman” nodesoperate as a smart home controller. On the other hand, the devices thatare battery powered are the “low-power” nodes. These nodes tend to besmaller than spokesman nodes and typically only communicate usingwireless protocols that require very little power, such as Zigbee®,ZWave®, 6LoWPAN, Thread®, Weave®, Bluetooth®, etc.

In some implementations, some low-power nodes are incapable ofbidirectional communication. These low-power nodes send messages, butthey are unable to “listen.” Thus, other devices in the smart homeenvironment 100, such as the spokesman nodes, cannot send information tothese low-power nodes. In some implementations, some low-power nodes arecapable of only a limited bidirectional communication. For example,other devices are able to communicate with the low-power nodes onlyduring a certain time period.

As described, in some implementations, the smart devices serve aslow-power and spokesman nodes to create a mesh network in the smart homeenvironment 100. In some implementations, individual low-power nodes inthe smart home environment regularly send out messages regarding whatthey are sensing, and the other low-powered nodes in the smart homeenvironment—in addition to sending out their own messages—forward themessages, thereby causing the messages to travel from node to node(i.e., device to device) throughout the smart home network 202. In someimplementations, the spokesman nodes in the smart home network 202,which are able to communicate using a relatively high-powercommunication protocol, such as IEEE 802.11, are able to switch to arelatively low-power communication protocol, such as IEEE 802.15.4, toreceive these messages, translate the messages to other communicationprotocols, and send the translated messages to other spokesman nodesand/or the server system 164 (using, e.g., the relatively high-powercommunication protocol). Thus, the low-powered nodes using low-powercommunication protocols are able to send and/or receive messages acrossthe entire smart home network 202, as well as over the Internet to theserver system 164. In some implementations, the mesh network enables theserver system 164 to regularly receive data from most or all of thesmart devices in the home, make inferences based on the data, facilitatestate synchronization across devices within and outside of the smarthome network 202, and send commands to one or more of the smart devicesto perform tasks in the smart home environment.

As described, the spokesman nodes and some of the low-powered nodes arecapable of “listening.” Accordingly, users, other devices, and/or theserver system 164 may communicate control commands to the low-powerednodes. For example, a user may use the electronic device 166 (e.g., asmart phone) to send commands over the Internet to the server system164, which then relays the commands to one or more spokesman nodes inthe smart home network 202. The spokesman nodes may use a low-powerprotocol to communicate the commands to the low-power nodes throughoutthe smart home network 202, as well as to other spokesman nodes that didnot receive the commands directly from the server system 164.

In some implementations, a smart nightlight 170 of FIG. 1, which is anexample of a smart device 204, is a low-power node. In addition tohousing a light source, the smart nightlight 170 houses an occupancysensor, such as an ultrasonic or passive IR sensor, and an ambient lightsensor, such as a photo resistor or a single-pixel sensor that measureslight in the room. In some implementations, the smart nightlight 170 isconfigured to activate the light source when its ambient light sensordetects that the room is dark and when its occupancy sensor detects thatsomeone is in the room. In other implementations, the smart nightlight170 is simply configured to activate the light source when its ambientlight sensor detects that the room is dark. Further, in someimplementations, the smart nightlight 170 includes a low-power wirelesscommunication chip (e.g., a ZigBee chip) that regularly sends outmessages regarding the occupancy of the room and the amount of light inthe room, including instantaneous messages coincident with the occupancysensor detecting the presence of a person in the room. As mentionedabove, these messages may be sent wirelessly (e.g., using the meshnetwork) from node to node (i.e., smart device to smart device) withinthe smart home network 202 as well as over the Internet to the serversystem 164.

Other examples of low-power nodes include battery-operated versions ofthe smart hazard detectors 104. These smart hazard detectors 104 areoften located in an area without access to constant and reliable powerand may include any number and type of sensors, such as smoke/fire/heatsensors (e.g., thermal radiation sensors), carbon monoxide/dioxidesensors, occupancy/motion sensors, ambient light sensors, ambienttemperature sensors, humidity sensors, and the like. Furthermore, smarthazard detectors 104 may send messages that correspond to each of therespective sensors to the other devices and/or the server system 164,such as by using the mesh network as described above.

Examples of spokesman nodes include smart doorbells 106, smartthermostats 102, smart wall switches 108, and smart wall plugs 110.These devices are often located near and connected to a reliable powersource, and therefore may include more power-consuming components, suchas one or more communication chips capable of bidirectionalcommunication in a variety of protocols.

As explained above with reference to FIG. 1, in some implementations,the smart home environment 100 of FIG. 1 includes a hub device 180 thatis communicatively coupled to the network(s) 162 directly or via thenetwork interface 160. The hub device 180 is further communicativelycoupled to one or more of the smart devices using a radio communicationnetwork that is available at least in the smart home environment 100.Communication protocols used by the radio communication network include,but are not limited to, ZigBee®, Z-Wave®, Insteon®, EuOcean®, Thread®,OSIAN®, Bluetooth Low Energy® and the like. In some implementations, thehub device 180 not only converts the data received from each smartdevice to meet the data format requirements of the network interface 160or the network(s) 162, but also converts information received from thenetwork interface 160 or the network(s) 162 to meet the data formatrequirements of the respective communication protocol associated with atargeted smart device. In some implementations, in addition to dataformat conversion, the hub device 180 further processes the datareceived from the smart devices or information received from the networkinterface 160 or the network(s) 162 preliminary. For example, the hubdevice 180 can integrate inputs from multiple sensors/connected devices(including sensors/devices of the same and/or different types), performhigher level processing on those inputs—e.g., to assess the overallenvironment and coordinate operation among the differentsensors/devices—and/or provide instructions to the different devicesbased on the collection of inputs and programmed processing. It is alsonoted that in some implementations, the network interface 160 and thehub device 180 are integrated to one network device. Functionalitydescribed herein is representative of particular implementations ofsmart devices, control application(s) running on representativeelectronic device(s) (such as a smart phone), hub device(s) 180, andserver(s) coupled to hub device(s) via the Internet or other Wide AreaNetwork. All or a portion of this functionality and associatedoperations can be performed by any elements of the described system—forexample, all or a portion of the functionality described herein as beingperformed by an implementation of the hub device can be performed, indifferent system implementations, in whole or in part on the server, oneor more connected smart devices and/or the control application, ordifferent combinations thereof.

FIG. 3 illustrates a representative operating environment in which aserver system 164 provides data processing for monitoring andfacilitating review of events (e.g., motion, audio, security, etc.) invideo streams captured by video cameras 118 and/or doorbell cameras 106.As shown in FIG. 3, the server system 164 receives video data from videosources 222 (including cameras 118, doorbell cameras 106, and/or othersmart home devices having an integrated video and audio capture system)located at various physical locations (e.g., inside homes, restaurants,stores, streets, parking lots, and/or the smart home environments 100 ofFIG. 1). Each video source 222 (which can include video andaudio-capture devices in addition to cameras 118) may be bound to one ormore reviewer accounts, and the server system 164 provides videomonitoring data for the video source 222 to client devices 220associated with the reviewer accounts. For example, the portableelectronic device 166 is an example of the client device 220. In someimplementations, the server system 164 is a video processing server thatprovides video processing services to video sources and client devices220.

In some implementations, each of the video sources 222 includes one ormore video cameras 118, doorbell cameras 106, or other video and audiocapture devices that capture video and send the captured video tocloud-based server system 164 substantially in real-time. In someimplementations, each of the video sources 222 can include a controllerdevice (not shown) that serves as an intermediary between the one ormore cameras and the cloud-based server system 164. The controllerdevice receives the video data from the one or more cameras, optionallyperforms some preliminary processing on the video data, and sends thevideo data to the server system 164 on behalf of the one or more camerassubstantially in real-time. In some implementations, each camera has itsown on-board processing capabilities to perform some preliminaryprocessing on the captured video data before sending the processed videodata (along with metadata obtained through the preliminary processing)to the controller device and/or the cloud-based server system 164.

In accordance with some implementations, each of the client devices 220(which can be a form of communication device) includes a client-sidemodule. The client-side module communicates with a server-side moduleexecuted on the server system 164 through the one or more networks 162.The client-side module provides client-side functionality for the eventmonitoring and review processing and communications with the server-sidemodule. The server-side module provides server-side functionality forevent monitoring and review processing for any number of client-sidemodules, each residing on a respective client device 220. Theserver-side module also provides server-side functionality for videoprocessing and camera control for any number of the video sources 222,including any number of control devices and the cameras.

In some implementations, the server system 164 includes one or moreprocessors 212, a video storage database 210, an account database 214,an I/O interface to one or more client devices 216, and an I/O interfaceto one or more video sources 218. The I/O interface to one or moreclients 216 facilitates the client-facing input and output processing.The account database 214 stores a plurality of profiles for accountsregistered with the video processing server, where a respective userprofile includes account credentials for a respective reviewer account,and one or more video sources linked to the respective account. The I/Ointerface to one or more video sources 218 facilitates communicationswith one or more video sources 222 (e.g., groups of one or more camerasand associated controller devices). The video storage database 210stores raw video data (and possibly audio) received from the videosources 222, as well as various types of metadata, such as motionevents, event categories, event category models, event filters, andevent masks, for use in data processing for event monitoring and reviewfor each reviewer account.

Examples of a representative client device 220 include communicationdevices such as a handheld computer, a wearable computing device, apersonal digital assistant (PDA), a tablet computer, a laptop computer,a desktop computer, a cellular telephone, a smart phone, an enhancedgeneral packet radio service (EGPRS) mobile phone, a media player, anavigation device, a game console, a television, a remote control, apoint-of-sale (POS) terminal, a vehicle-mounted computer, an ebookreader, or a combination of any two or more of these data processingdevices or other data processing devices.

Examples of the one or more networks 162 include local area networks(LAN) and wide area networks (WAN) such as the Internet. The one or morenetworks 162 are implemented using any known network protocol, includingvarious wired or wireless protocols, such as Ethernet, Universal SerialBus (USB), FIREWIRE, Long Term Evolution (LTE), Global System for MobileCommunications (GSM), Enhanced Data GSM Environment (EDGE), codedivision multiple access (CDMA), time division multiple access (TDMA),Bluetooth, Wi-Fi, voice over Internet Protocol (VoIP), Wi-MAX, or anyother suitable communication protocol.

In some implementations, the server system 164 is implemented on one ormore standalone data processing apparatuses or a distributed network ofcomputers. In some implementations, the server system 164 also employsvarious virtual devices and/or services of third party service providers(e.g., third-party cloud service providers) to provide the underlyingcomputing resources and/or infrastructure resources of the server system164. In some implementations, the server system 164 includes, but is notlimited to, a server computer, a handheld computer, a tablet computer, alaptop computer, a desktop computer, or a combination of any two or moreof these data processing devices or other data processing devices.

The server-client environment shown in FIG. 3 includes both aclient-side portion (e.g., the client-side module) and a server-sideportion (e.g., the server-side module). The division of functionalitybetween the client and server portions of operating environment can varyin different implementations. Similarly, the division of functionalitybetween a video source 222 and the server system 164 can vary indifferent implementations. For example, in some implementations, theclient-side module is a thin-client that provides only user-facing inputand output processing functions, and delegates all other data processingfunctionality to a backend server (e.g., the server system 164).Similarly, in some implementations, a respective one of the videosources 222 is a simple video capturing device that continuouslycaptures and streams video data to the server system 164 with limited orno local preliminary processing on the video data. Although many aspectsof the present technology are described from the perspective of theserver system 164, the corresponding actions performed by a clientdevice 220 and/or the video sources 222 would be apparent to one ofskill in the art. Similarly, some aspects of the present technology maybe described from the perspective of a client device or a video source,and the corresponding actions performed by the video server would beapparent to one of skill in the art. Furthermore, some aspects of thepresent technology may be performed by the server system 164, a clientdevice 220, and a video source 222 cooperatively.

In some implementations, a video source 222 (e.g., cameras 118, doorbell106, or some other video and audio capture device) transmits one or morestreams of video data to the server system 164. In some implementations,the one or more streams may include multiple streams of respectiveresolutions and/or frame rates, of the raw video captured by the cameras118 or smart doorbell 106. In some implementations, the multiple streamsmay include a “primary” stream with a certain resolution and frame rate,corresponding to the raw video captured by the cameras 118 or smartdoorbell 106, and one or more additional streams. An additional streammay be the same video stream as the “primary” stream but at a differentresolution and/or frame rate, or a stream that captures a portion of the“primary” stream (e.g., cropped to include a portion of the field ofview or pixels of the primary stream) at the same or differentresolution and/or frame rate as the “primary” stream.

In some implementations, one or more of the streams are sent from thevideo source 222 directly to a client device 220 (e.g., without beingrouted to, or processed by, the server system 164). In someimplementations, one or more of the streams are stored at the cameras118 or smart doorbell 106 and/or a local storage device (e.g., adedicated recording device), such as a digital video recorder (DVR). Forexample, in accordance with some implementations, the cameras 118 orsmart doorbell 106 stores the most recent 24 hours of video footagerecorded by the camera. In some implementations, portions of the one ormore streams are stored at the cameras 118 or smart doorbell 106 and/orthe local storage device (e.g., portions corresponding to particularevents or times of interest).

In some implementations, the server system 164 transmits one or morestreams of video data to a client device 220 to facilitate eventmonitoring by a user. In some implementations, the one or more streamsmay include multiple streams, of respective resolutions and/or framerates, of the same video feed. In some implementations, the multiplestreams include a “primary” stream with a certain resolution and framerate, corresponding to the video feed, and one or more additionalstreams. An additional stream may be the same video stream as the“primary” stream but at a different resolution and/or frame rate, or astream that shows a portion of the “primary” stream (e.g., cropped toinclude a portion of the field of view or pixels of the primary stream)at the same or different resolution and/or frame rate as the “primary”stream, as described in greater detail in U.S. patent application Ser.No. 15/594,518, which is hereby incorporated by reference.

FIG. 4 illustrates an embodiment of an asymmetric video communicationsystem 400. Asymmetric video communication system 400 may include:cloud-based communication server system 410, network 415, video andaudio capture device 420, and communication device 430. Video and audiocapture device 420 can represent an embodiment of: cameras 118 or someother smart home device capable of capturing video and audio. Video andaudio capture device 420 may include: video camera 421, microphone 422,tactile sensor 423, speaker 424, status light 425, processing system426, and wireless network interface 427.

Video camera 421 and microphone 422 may serve to capture a stream ofvideo and audio, respectively, of a region near video and audio capturedevice 420. Video camera 421 may have a field of view pointed in a fixedor adjustable direction. Video and audio captured using a video camera421 and microphone 422 may be streamed via network 415 to cloud-basedcommunication server system 410.

Video and audio capture device 420 may include tactile sensor 423. Insome embodiments, tactile sensor 423 is an accelerometer. Such anaccelerometer may detect acceleration of video and audio capture device420. Such an accelerometer may provide acceleration measurements toprocessing system 426. Typically, significant acceleration measurementsmay be detected when video and audio capture device 420 is moved or someperson or object impacts video and audio capture device 420. Referringto a person, someone may tap, knock, nudge, turn, shake, wiggle, orotherwise cause an acceleration event to be detected using theaccelerometer functioning as tactile sensor 423. In other embodiments,some other form of tactile sensor may be used. For example, a gyroscopemay be used to detect an impact or motion of video and audio capturedevice 420.

Speaker 424 may be used to output audio received from cloud-basedcommunication server system 410. Status light 425 may be a single ormulti-colored light (e.g., LED or multiple LEDs), that can output astatus indication of video and audio capture device 420. For example,different colors and/or different flashing patterns can representvarious illumination states that correspond to different statuses ofvideo and audio capture device 420.

Processing system 426 may receive data from: video camera 421,microphone 422, and tactile sensor 423. Processing system 426 may outputdata to: speaker 424, and status light 425. Processing system 426 maystream data collected from video camera 421 and microphone 422 tocloud-based communication server system 410. Processing system 426 mayanalyze data received from tactile sensor 423 to determine if a requestfor an asymmetric communication link should be transmitted tocloud-based communication server system 410. If, for example, a detectedamount of acceleration by tactile sensor 423 exceeds a stored andpredefined acceleration threshold, processing system 426 may transmit arequest for an asymmetric communication link to cloud-basedcommunication server system 410. In some embodiments, multipleexcursions of acceleration above the predefined acceleration thresholdmay be required to be detected by processing system 426 within apredefined period of time in order for a request for an asymmetriccommunication link to be transmitted to cloud-based communication serversystem 410. In other embodiments, a pattern detection process may beperformed by processing system 426 to determine a pattern ofacceleration events and the magnitude of such events. For instance, toinitiate an asymmetrical communication link, a pattern of accelerationmay be required to be detected by tactile sensor 423 as analyzed byprocessing system 426. This pattern of acceleration may be required tomatch a predefined stored definition of an acceleration pattern for anasymmetrical communication link request. In other embodiments, anindication of the pattern is transmitted to the cloud-basedcommunication server system 410 as part of or with the request for theasymmetric communication link.

Processing system 426 may include one or more specialized or generalpurpose processors. Such processors may be configured to execute one ormore sets of instructions that are stored by video and audio capturedevice 420 using one or more non-transitory computer readable mediums.It should be understood that in addition to processors andnon-transitory computer readable mediums, video and audio capture device420 may include various other components typically found in computerizeddevices, such as communication buses, power supplies, one or morewireless interfaces, memories, etc. Processing system 426 maycommunicate with cloud-based communication server system 410 viawireless network interface 427. Audio to be output by speaker 424 may bereceived by processing system 426 via wireless network interface 427.Similarly, an indication to be presented to nearby users may be receivedvia wireless network interface 427 and output by processing system 426via status light 425. For example, an illumination state (which caninclude a blinking or solid illumination pattern and/or a particularlight color) may be indicative of a request having been transmitted, butyet having been accepted for an asymmetric communication link. Anotherillumination state may be used to indicate normal operation (e.g., videoand audio being streamed, no request having been transmitted); anotherillumination state may be used to indicate an asymmetric communicationlink is open; and another illumination state may be used to indicatethat the asymmetric communication link was refused. Wireless networkinterface 427 may use one or more of the previously described networkcommunication protocols, such as IEEE 802.11 or, more specifically,WiFi®.

Housing 428 may house: video camera 421, microphone 422, tactile sensor423, speaker 424, status light 425, processing system 426, and wirelessnetwork interface 427. Typically, when the user provides input to videoand audio capture device 420, such as in the form of a tap or bump, sucha tap or bump may be applied to housing 428 and may be sensed viatactile sensor 423.

Of note, video and audio capture device 420 may not have a display or atleast not have a display that is capable of presenting an image orvideo. Therefore, while video camera 421 may be used to stream data tocloud-based communication server system 410, video may not be receivedby and/or presented by video and audio capture device 420. Audio,however, may be received via wireless network interface 427 and becaused to be output via speaker 424 by processing system 426.

Cloud-based communication server system 410 can represent cloud-basedserver system 164 of FIGS. 1-3. While FIG. 3 represents some of thestorage and streaming components of cloud-based server system 164,cloud-based communication server system 410 represents componentryrelated to asymmetric communication links any associated requests.Further detail regarding cloud-based communication server system 410 isprovided in relation to FIG. 5.

Communication device 430 can represent client devices 220. Notably,while video and audio capture device 420 does not have a display,communication device 430 includes display 433 for presenting video, suchas video captured using video camera 421. Communication device 430 caninclude: video camera 431, microphone 432, display 433, speaker 434,user interface 435, processing system 436, and wireless networkinterface 438. As previously noted in respect to client devices 220,communication device 430 can be a handheld computer, a wearablecomputing device, a personal digital assistant (PDA), a tablet computer,a laptop computer, a desktop computer, a cellular telephone, a smartphone, an enhanced general packet radio service (EGPRS) mobile phone, amedia player, a navigation device, a game console, a television, aremote control, a point-of-sale (POS) terminal, a vehicle-mountedcomputer, an ebook reader, or a combination of any two or more of thesedata processing devices or other data processing devices.

Video camera 431, microphone 432, speaker 434, and wireless networkinterface 438 may function as detailed in relation to the similarcomponents of video and audio capture device 420. While video camera 431is present, if an asymmetric communication link is active between videoand audio capture device 420 and communication device 430, no video fromvideo camera 431 may be streamed to video and audio capture device 420(video and audio capture device may have no way of presenting video).Display 433 and user interface 435 may be combined in the form of atouchscreen that allows a user to touch or perform gestures directly ondisplay 433. User interface 435 may also take other forms, such as akeypad, one or more buttons, a touchpad, or some other arrangement thatallows a user to provide input to communication device 430.

Processing system 436 may include one or more specialized or generalpurpose processors. Such processors may be configured to execute one ormore sets of instructions that are stored by communication device 430using a non-transitory computer readable medium. It should be understoodthat in addition to processors and non-transitory computer readablemediums, communication device 430 may include various other componentstypically found in computerized devices, such as communication buses,power supplies, one or more wireless interfaces, memories, etc.Processing system 436 may communicate with cloud-based communicationserver system 410 via wireless network interface 427. Therefore, when anasymmetric communication link is present between video and audio capturedevice 420 and communication device 430, audio captured using microphone432 may be streamed to video and audio capture device 420; video andaudio captured using video camera 421 and microphone 422 may be streamedto communication device 430 and output via display 433 and speaker 434.

If processing system 436 of communication device 430 is ageneral-purpose processing system, one or more applications, including acommunication application 437, may be executed by processing system 436.Communication application 437 may be a specialized application that isdistributed by an operator of cloud-based communication server system410. Communication application 437 may permit a user to log in to a useraccount hosted by cloud-based communication server system 410 and accessservices available through such user account. For example, access tovideo and audio captured using video camera 421 and microphone 422 maybe available via cloud-based communication server system 410. Further,as detailed within this document, a user may be able to accept or deny arequest for an asymmetric communication like initiated at video andaudio capture device 420.

In some embodiments, another entity, such as a third-party cellularservice provider may be in communication with network 162. Cloud-basedcommunication server system 410 may communicate with the third partiescellular service provider in order to send a push notification tocommunication device 430. For example, if a request for an asymmetriccommunication link is received by cloud-based communication serversystem 410 from video and audio capture device 420, cloud-basedcommunication server system 410 may send a request for a pushnotification to be transmitted by the third party cellular serviceprovider to communication device 430. This push notification may berequired to be viewed and dismissed by a user of communication device430 before some or any function of communication device 430 is next usedby the user. This notification may include information about theasymmetric communication link request, such as (a link or direct to) animage captured via video camera 421 and/or video clip captured by videocamera 421. If a video clip is included, the video clip may includevideo captured by video camera 421 for a period of time before and/orperiod of time after the tactile input that triggered the request wasdetected by processing system 426 (based on data received from tactilesensor 423).

In some embodiments, video and audio capture device 420 may communicatedirectly with communication device 430 or may communication withcommunication device 430 via network(s) 162 without communicating with aseparate cloud-based communication server system 410. For example,communication device 430 may be a baby monitor having a display andvideo and audio capture device 420 may be a streaming video camera thattransmits video, either directly or via a network (e.g., a wirelesslocal area network) to the baby monitor. In embodiments where nocloud-based communication server system 410 is present, some or all ofthe functionality detailed herein attributed to cloud-basedcommunication server system 410 may incorporated as part ofcommunication device 430 and/or video and audio capture device 420.

FIG. 5 illustrates an embodiment of cloud-based communication serversystem 500.

Cloud-based communication server system 500 can represent cloud-basedcommunication server system 410 of FIG. 4 and/or cloud-based serversystem 164 of FIGS. 1-3. Cloud-based communication server system 500 caninclude one or more processors and/or one or more server systems whichmay be distributed geographically. Cloud-based communication serversystem 500 may include one or more specialized or general purposeprocessors. Such processors may be configured to execute one or moresets of instructions that are stored by cloud-based communication serversystem 500 using a non-transitory computer readable medium. It should beunderstood that in addition to processors and non-transitory computerreadable mediums, cloud-based communication server system 500 mayinclude various other components typically found in computerizeddevices, such as communication buses, power supplies, one or morewireless interfaces, memories, etc. cloud-based communication serversystem 500 may include: video and audio storage engine 510,communication link initiator 520, notification generator 530, and livecommunication link channel engine 540. Also present may be variousdatabases previously detailed, including video storage database 210 andaccount database 214.

Video and audio storage engine 510 may include the I/O interface to oneor more video sources 218 that facilitates communications with one ormore video sources 222 (e.g., groups of one or more cameras andassociated controller devices) as detailed in relation to FIG. 3. Videoand audio storage engine 510 may serve to receive video and audio datafrom various video and audio capture devices such as video and audiocapture device 420 of FIG. 4. Video and audio storage engine 510receives streaming video, and possibly streaming audio, from a video andaudio capture device and stores such video and audio to a database, suchas video storage database 210. While video storage database 210 refersspecifically to video, it should be understood that audio may be storedin the same or a separate linked database. It should be understood thatvideo and audio from a capture device may be continuously stored byvideo and audio storage engine 510 regardless whether an asymmetriccommunication link is active or not. Therefore, if an asymmetriccommunication link is active, the video and audio may continue to bestored from the associated capture device; however, any audio routed inthe opposite direction (that is, from the communication device to thevideo and audio capture device) may not be recorded or stored.

Live communication link initiator 520 may serve to receive and analyzerequests for asymmetric communication links received from video andaudio capture devices. Live communication link initiator 520 may receivean asymmetric communication link request, analyze such request, anddetermine a recipient linked with a communication device which is toreceive the request for the asymmetric communication link. Communicationlink initiator 520 may access account database 214 to determine acontact and associated communication device linked with the video andaudio capture device. For example, the video and audio capture deviceand the communication device may be linked with a user account. In someembodiments, live communication link initiator 520 may analyze a patternof tactile information, such as acceleration data, received from thevideo and audio capture device in accordance with the asymmetriccommunication link request.

Based upon the video and audio capture device from which the asymmetriccommunication link request is received, live communication linkinitiator 520 may access a linked entry within account database 214. Thelinked account database entry may identify one or more contacts forrouting of the asymmetric communication link request. Three sampleentries 550 are illustrated from account database 214. It should beunderstood that these three sample entries 550 are merely exemplary andillustrate some of the possible data fields which may be present withinaccount database 214. In other embodiments, fewer or greater numbers ofdata fields may be present. Further, such data fields may be spreadacross multiple databases rather than all being present within accountdatabase 214. For example, for the user account of “Jeff.P.Doe”, twocontacts are present: the smart phone linked with Jeff.P.Doe and a smartphone linked with Jane.P.Doe. By Jeff.P.Doe being the default contact, asingle tactile input that is greater than the tactile threshold mayresult in an asymmetric communication link request received from thevideo and audio capture device linked with this account to be routed tothe smart phone of Jeff.P.Doe. However, if accompanying data indicatesthat the pattern of tactile input is a double tap, the asymmetriccommunication link request may be routed to the smart phone ofJane.P.Doe. Regarding the second example entry, the user account of“JohnJJoseph” does not have asymmetric communication links enabled; assuch, the video and audio capture device linked with this account maynot transmit asymmetric communication link requests or such receivedasymmetric communication link requests may be ignored by cloud-basedcommunication server system 500. For the third sample entry of useraccount “ErinS,” a different secondary contact Pattern is indicated. Ifthe tactile input pattern is indicative of a single tap, followed by ashort pause, followed by a double tap, the asymmetric communication linkrequest may be routed to the web account of “James3493.”

Once live communication link initiator 520 has received an asymmetriccommunication link request, and has determined that the associated useraccount has such a feature enabled, and the appropriate recipientcontact has been determined, such data may be routed to notificationgenerator 530. Notification generator 530 may generate a notificationfor transmitting to the appropriate contact indicated from withinaccount database 214. The notification generated by a notificationgenerator 530 may be an in-application notification, a push notification(e.g., as detailed in relation to FIG. 4), SMS or MMS text message,email, phone call, banner, desktop tray, or pop-up notification.Notification generator 530 may select the appropriate communicationchannel based on a user preference noted in account database 214 or adefault communication channel used by cloud-based communication serversystem 500. The notification generated by notification generator 530 mayindicate the video and audio capture device from which the requestoriginated (e.g., in the form of a location name linked with the capturedevice, such as “bedroom,” “living room,” “kitchen,” or some other roomdesignator). The notification may also include a still video framecaptured by the video and audio capture device at the time the tactileinput was provided to the capture device that triggered the request. Inother embodiments, the still video frame may be captured for apredefined amount of time before or after the tactile input was providedto the capture device that triggered the asymmetric communication linkrequest. Since the video is being constantly streamed to the cloud-basedcommunication server, video frames from prior to the tactile input maybe available. In other embodiments, a short video clip is provided aspart of the notification rather than a still image. This video clip mayinclude audio. The video clip may be captured before, after, or in aperiod of time that spans from before until after the tactile input thattriggered the asymmetric communication link request. As such, the videoclip may temporally span from before the input until after the input.The notification generator 530 may then send the generated notificationto the communication device determined based on the contact informationlinked with the video and audio capture device in account database 214.

Live communication link channel engine 540 may relay audio and videodata (e.g., packets) from the capture device to the communication devicefor live or real-time presentation and may relay audio data (e.g.,packets) from the communication device to the capture device for live orreal-time output. Once this asynchronous communication link has beeninitiated, the communication link may continue to be provided by livecommunication link channel engine 540 until a user of the communicationdevice severs the link by, for example, closing an application beingexecuted by the communication device that enables audio captured by thecommunication device to be transmitted to live communication linkchannel engine 540. In some embodiments, tactile input to the capturedevice provided by a person in the vicinity of the capture device mayindicate that the person desires the communication link to be severed.However, this may only be in the form of a request that is transmittedto and presented on the communication device. Ultimately, the user ofthe communication device may make the decision of if and when theasynchronous communication link is severed. It should be understood thatregardless of the state of the asynchronous communication link, videoand audio storage engine 510 may store continuous video and audio fromthe capture device. Therefore, effectively, if a conversation occurredbetween a person present in the vicinity of the captured device and auser of the communication device, at least the side of the conversation(and including video) from the capture device may be stored in videostorage 10 by video and audio storage engine 510. Audio from thecommunication device may also be stored or may be filtered out (eitherby the captured device or by video and audio storage engine 510).

FIG. 6 illustrates an embodiment 600 of a person requesting anasymmetric communication link via a video and audio capture device. Inembodiment 600, child 601 is tapping with his hand 602 a housing ofvideo and audio capture device 610. In response to this tapping, anasymmetric communication link request may be transmitted to acloud-based server system as detailed in relation to the systems ofFIGS. 1-5 and the method of FIGS. 7 and 8. The tapping pattern performedby child 601 may be dependent on the person with whom child 601 desiresto communicate. An illumination state of status light 611 may changebased on the tapping performed by hand 602 triggering a request beingsent to the cloud-based server system. While the example of embodiment600 involves a child, it should be understood that this is merelyexemplary. Various persons, such as the elderly, workmen, or any personwho finds it more expedient to communicate with a user via a video andaudio capture device, may wish to initiate an asymmetric videocommunication link with a user.

Various methods may be performed with the systems detailed in FIGS. 1-5.FIG. 7 illustrates an embodiment of a method 700 for establishing anasymmetric communication link. Method 700 may be performed using thesystems and devices of FIGS. 1-5. Specifically, referring to FIG. 4,each step of method 700 may be performed using video and audio capturedevice 420, communication device 430, and/or cloud-based communicationserver system 410.

At block 710, a video and audio stream may be captured using a capturedevice, such as video and audio capture device 420. This video and audiodata stream may be transmitted to a cloud-based system continuously ornear continuously for storage at block 720. Therefore, such as for arolling period of time, the cloud-based system may store video and audiocaptured by the capture device allowing a user to retrieve video andaudio for desired time period for review. The cloud-based system forstorage may be a service provided by the same cloud-based server systemas cloud-based communication server system 410. In some arrangements,the capture device may wirelessly stream video and audio data via aWi-Fi network to a router which is connected with an internet serviceprovider (ISP) and the Internet. The video and audio data may betransmitted to the cloud-based system for storage via the Internet. Thisdata may be indicated as being associated with a particular capturedevice and/or a particular user account.

At block 730, while video and audio are being captured and transmittedby the capture device to the cloud-based system for storage, a tactileevent may be detected by the capture device. The tactile event may bedetected as being greater than a defined threshold. In some embodiments,acceleration is used as the measure of the tactile of event. An onboardaccelerometer of the capture device may detect one or more accelerationevents that exceed the defined acceleration threshold stored by thecapture device. In some embodiments, rather than a single accelerationevent being greater than the defined threshold, it may be required thatmultiple acceleration events greater than the defined threshold may needto occur within a defined period of time for a tactile event to bedetermined as having occurred. Additionally or alternatively, a patternof acceleration events may be detected by the capture device. Variancesin timing and magnitude of the acceleration events may be detected bythe capture device. In other embodiments, tactile events are measuredusing a metric other than acceleration. It should be understood thattactile events refer to input provided to the capture device other thana button push or other form of input provided directly to a dedicateduser input component, such as a touch screen, touch pad, switch, orknob. In some embodiments, the accelerometer located onboard the capturedevice may be used for one or more additional uses, such as detecting ifthe capture device has been moved. Such movement may be indicative of anintruder at a location, such as a burglar, moving the capture device inan attempt to avoid detection.

At block 740, in response to detecting the tactile event at block 730, arequest for an asymmetric video communication link may be transmitted tothe cloud-based system at block 740. The cloud-based system to which therequest is transmitted may be the same cloud-based system used forstorage at block 720 or may be a separate cloud-based system that isspecifically for communication links. If the same cloud-based system,the term cloud-based storage server system or cloud-based communicationserver system may be used interchangeably. The request transmitted atblock 740 may indicate the capture device from which the request isoriginating and/or an indication of the user account that is linked withthe capture device. If a pattern of tactile data, such as accelerationdata, has been captured by the capture device, a pattern indication(which is indicative of the pattern detected at the capture device) maybe transmitted as part of or in association with the request for theasymmetric communication link.

At block 750, the cloud-based system may analyze the received request todetermine a communication device to which the request should be routed.Block 750 may involve the cloud-based system performing a lookup ofaccount data to determine a user account that is linked with the capturedevice. The account data may indicate a particular user or device thathas been designated to receive such requests. In some embodiments, therequest may be transmitted to all devices that have been linked with theuser account. In some embodiments, if pattern data has been receivedwith the request for the asymmetric video communication link, thepattern data may be used to determine a particular communication deviceamong multiple communication devices that are linked with the useraccount. Therefore, different predefined patterns may be linked withdifferent communication devices that are linked with the same useraccount. In some embodiments, in response to block 750, method 700 mayproceed to block 770 at which an asymmetric communication link isestablished between the capture device and communication device. Inother embodiments, method 700 may proceed to block 760.

At block 760, a notification may be transmitted to the communicationdevice that prompts a user of the communication device that a requestfor an asymmetric video communication link has been received at thecapture device. The notification may be a push notification (which maybe relayed by a cellular service provider of the communication device),a text message, an email, an automated voice call, and/or an applicationnotification. The notification of block 760 may indicate that anasynchronous communication link request has been received from thecapture device or that movement has been detected at the capture device.The notification may further include video, an image, and/or audio thatwas captured using the capture device a predefined amount of timebefore, at, or a predefined time after the request was detected by thecapture device. In some embodiments, if a video clip is to be includedwith the notification, the video clip may span for a predefined amountof time prior to the tactile event indicative of the request for apredefined period of time after the tactile event. For some embodiments,a circular semiconductor video memory can be used to continuouslycapture the most recent video clip that will become of interest upontactile event detection. The notification may trigger the user to openan application associated with the capture device. This application mayallow a user to view a live or a real-time stream of video and audiofrom the capture device. The user of the communication device may havethe option to activate audio to communicate with a person in thevicinity of the capture device. In some embodiments, in response to thenotification being received by the communication device, thecommunication device may transmit a receipt to the cloud-based serversystem. The cloud-based server system may then transmit a receiptindication to the capture device which may trigger a change in anillumination state of a status light of the capture device. This changein illumination state may indicate to a person in the vicinity of thecapture device that the notification has been successfully sent toand/or received by the communication device. If the user of thecommunication device dismisses the notification, a receipt indicationmay be transmitted to the capture device by the cloud-based serversystem that triggers a change in the illumination state of the statuslight of the capture device to indicate that the asynchronouscommunication link will not be activated. Regardless of whether thenotification results in the asynchronous communication link beingactivated or not, video and audio may continue to be captured by thecapture device and stored by the cloud-based system for storage at block710 and 720.

At block 770, the asymmetric communication link may be establishedbetween the capture device and the communication device. Theasynchronous communication link may be established by a user launchingan application or webpage that requests and presents real time livevideo received by the cloud-based server system from the capture device.The asymmetric communication link being established may include:real-time or live video and audio being presented and output,respectively, by the communication device from the capture device. Thisvideo and audio may be streamed to the communication device via thecloud-based server system. Live audio may also be streamed from thecommunication device to the capture device. In some embodiments, ratherthan audio being continuously streamed from the communication device tothe capture device, the communication device may present an interfacethat allows a user to capture audio snippets using the communicationdevice and transmit such audio snippets to the capture device via thecloud-based server. Therefore, for example, a user may be required tohold down a real or virtual button on the communication device tocapture audio from the communication device which is then transmitted tothe capture device via the cloud-based server. In other embodiments, inresponse to the asymmetric communication link being established,continuous audio captured via a microphone of the communication deviceis streamed to the capture device for output. As previously noted, insome embodiments, rather than a notification being transmitted to thecommunication device, in response to block 750, the asymmetriccommunication link at block 770 may be established. This may involveaudio being captured by the communication device and transmitted to thecapture device for output without input being provided to thecommunication device by the user indicating an immediate desire toactivate such an asynchronous communication link.

FIG. 8 illustrates an embodiment of a method 800 for enablingasymmetrical video communication for a video and audio capture device.Method 800 may be performed using the systems and devices of FIGS. 1-5.Specifically, referring to FIG. 4, each step of method 800 may beperformed using video and audio capture device 420, communication device430, and/or cloud-based communication server system 410. Method 800 maybe performed prior to method 700 being performed for a first time.

At block 810, a user may log in to a user account, possibly from acommunication device at which the user desires to receive asynchronouscommunication link requests. Such a login may occur via an applicationthat is executed by the communication device. Such an application may bedownloaded from an application store (“app store”) or some other remoterepository of applications. At block 820, the user may provide inputthat specifies one or more contacts to be linked with asymmetric videocommunication requests. This may be the communication device and userhimself. In other embodiments, the user may specify some other useraccount, communication device, or user that should be contacted if anasynchronous communication request is received.

If multiple users are to be individually contactable via differenttactile input patterns, block 830 may be performed. At block 830, theapplication may request that the user input a customized tactile inputthat is to be linked with the specified one or more contacts of block820. This request may be in the form of a message presented on a displayscreen of the communication device. Further, this request may indicatethat the user should input the tactile input pattern to the capturedevice or directly to the communication device. If to be provided to thecapture device, method 800 may be performed while the user andcommunication device are in close proximity to the capture device.

At block 840, the customized tactile input pattern may be received. Insome embodiments, this customized tactile input pattern is received viathe capture device or, alternatively, via the communication device. Aspart of block 830, if the customized tactile input pattern is to bereceived via the capture device, the application may alert thecloud-based server system that the customized tactile input pattern isto be received via the capture device and an indication that such inputas to be received may be transmitted to the capture device. This mayplace the capture device in a mode in which such a customized tactileinput pattern can be received. An illumination state of the status lightof the capture device may be set to indicate that the capture device isready to receive the customized tactile input pattern. After thecustomized tactile input pattern has been received at block 840, whethervia the capture device or via the communication device, the user may beprompted to repeat the customized tactile input pattern some number ofadditional times, such as one or two additional times. The customizedtactile input pattern may have variances in the magnitude and timing oftactile input (e.g., taps). Blocks 823, 840 may be repeated such thatdifferent customized tactile input patterns may be set up for differentcommunication devices and/or users or user groups. Therefore, onceasymmetric video communication requests have been activated, thesedifferent customized tactile input patterns may be used to sendnotifications requesting asymmetric communication links with differentusers to different user groups. A common use case of this arrangementmay be if a child in the vicinity of a capture device wishes to contactonly her mother or only her father.

Once at least one contact has been linked with tactile triggeredasymmetric video communication requests, a message may be transmitted tothe cloud-based server system and/or the capture device that authorizesasymmetric video communication requests to be transmitted at block 850.At block 860, asymmetrical video communication is enabled. This caninclude a flag or data field being set at the capture device and/orcloud-based server system that enables asymmetrical video communication.Prior to block 850, tactile input provided to the capture device may nottrigger the capture device to send an asymmetric video communicationlink request to the cloud-based server system or the cloud-basedcommunication server system may not forward a notification or establishsuch a communication link with a communication device in response toreceiving an asymmetric communication link request from the capturedevice. If authorization for asymmetric video communication linkrequests is enforced by the capture device, data may be stored by thecapture device indicating that asymmetric video communication requestscan now be transmitted to the cloud-based server system. Ifauthorization for asymmetric video communication link requests isenforced by the cloud-based server system, the capture device maycontinue to function as prior to block 850. That is, tactile inputinformation above a threshold may be forwarded to the cloud-based serversystem for interpretation and, possibly, for creation of a notificationof a request for an asymmetric video communication link to acommunication device or user linked with the capture device and/or thetactile pattern.

One or more of the described embodiments may be particularlyadvantageous when adapted for use in so-called sleepy capture devices,which are often battery-only devices which have a very limited powerbudget, or, if they are not power constrained, have other limitations ordesired attributes (e.g., to produce very little heat, to produceminimal RF emissions, etc.) such that it is desirable to keep theirhigher-powered radio communications chips (and/or higher-powerprocessors) in a sleep state for most of the time. For some embodiments,there is provided a capture device with an RF chip and imaging systemthat stays off for substantial periods of time, but is equipped with anaccelerometer or other tactile sensor that causes the capture device to“wake up” (e.g., transition to a higher-power mode) when a sufficientlystrong tactile tap or bump event is sensed, and the asymmetriccommunication session is started at that point. This arrangement maysave power because the only component or one of the only components thatneeds to stay awake is the accelerometer/tactile sensor, which can beformed from very low-power components.

According to some embodiments, the capture device is further equippedwith a motion sensor and a circular internal semiconductor video buffer(e.g., capable of keeping 30-300 seconds of video), this configurationcan be very advantageous when configured to operate as follows: thedevice stays in a deep sleep most of the time, with RF radio chips andimaging systems in a sleep state and, optionally, even the tactilesensor in a sleep state; the deep sleep state continues unless and untilmotion is detected, at which point a first, lower-powered wake state isentered in which the network interface (e.g., RF radio chip) remainsasleep but the imaging system fills the circular buffer continuously asmotion is detected and the tactile sensor waits to sense a tactileevent; the lower-powered wake state continues (while motion is stilldetected, otherwise it goes back into deep sleep) until a tactile eventis sensed, at which point a higher-powered wake state is entered inwhich the RF radio chip is also activated and the asymmetriccommunication session is started, the asymmetric communication sessionincluding an RF upload of the buffered video content for review ifdesired. This particular functionality, which can be the sole modalityof the capture device in some embodiments and a user-selectable modalityin other embodiments, can be particularly advantageous for a scenario ofa children's play room as follows: the capture device sleeps deeply mostof the time if no children are present; if children are present, then a30-300 second history is kept locally but energy is not expended in RFuploading; and if a child then walks up and taps the capture device,then the asymmetric communication session is established and isaccompanied by a reviewable 30-300 seconds of recent video, which wouldallow the parent or other viewer to see what also happened in theimmediate 30-300 seconds before the camera was tapped. Thus, forexample, if Child gets a boo-boo and then taps the capture device totalk to Mom, Mom can review the buffered video to see how it happened.In this manner there is an advantageous best-of-both-worlds provided, inthat there is a brief video history available in conjunction with thechild-friendly asymmetric communication session feature, while at thesame time being provided in a generally sleepy device that saves power,which can facilitate the capture device being a battery-only device.

The methods, systems, and devices discussed above are examples. Variousconfigurations may omit, substitute, or add various procedures orcomponents as appropriate. For instance, in alternative configurations,the methods may be performed in an order different from that described,and/or various stages may be added, omitted, and/or combined. Also,features described with respect to certain configurations may becombined in various other configurations. Different aspects and elementsof the configurations may be combined in a similar manner. Also,technology evolves and, thus, many of the elements are examples and donot limit the scope of the disclosure or claims.

Specific details are given in the description to provide a thoroughunderstanding of example configurations (including implementations).However, configurations may be practiced without these specific details.For example, well-known circuits, processes, algorithms, structures, andtechniques have been shown without unnecessary detail in order to avoidobscuring the configurations. This description provides exampleconfigurations only, and does not limit the scope, applicability, orconfigurations of the claims. Rather, the preceding description of theconfigurations will provide those skilled in the art with an enablingdescription for implementing described techniques. Various changes maybe made in the function and arrangement of elements without departingfrom the spirit or scope of the disclosure.

Also, configurations may be described as a process which is depicted asa flow diagram or block diagram. Although each may describe theoperations as a sequential process, many of the operations can beperformed in parallel or concurrently. In addition, the order of theoperations may be rearranged. A process may have additional steps notincluded in the figure. Furthermore, examples of the methods may beimplemented by hardware, software, firmware, middleware, microcode,hardware description languages, or any combination thereof. Whenimplemented in software, firmware, middleware, or microcode, the programcode or code segments to perform the necessary tasks may be stored in anon-transitory computer-readable medium such as a storage medium.Processors may perform the described tasks.

Having described several example configurations, various modifications,alternative constructions, and equivalents may be used without departingfrom the spirit of the disclosure. For example, the above elements maybe components of a larger system, wherein other rules may takeprecedence over or otherwise modify the application of the invention.Also, a number of steps may be undertaken before, during, or after theabove elements are considered.

What is claimed is:
 1. An asymmetrical video communication system,comprising: an application executed using one or more processors of amobile device, wherein the mobile device comprises a mobile devicedisplay; a mobile device microphone; and a mobile device speaker; and avideo and audio capture device, comprising: a housing; one or moreprocessors housed by the housing; a video camera housed by the housing;a microphone housed by the housing; a speaker housed by the housing; andan accelerometer housed by the housing, wherein: the video and audiocapture device does not have a video display; in response to detectingan acceleration of the housing over a predefined acceleration threshold,an acceleration-triggered asymmetrical video communication request iscaused to be transmitted to a cloud-based communication server system;and the cloud-based communication server system is configured to:receive, from a device, a request to log into a user account of aplurality of user accounts, wherein the user account is linked with thevideo and audio capture device; receive, from the device while loggedinto the user account, an indication of a default contact and asecondary contact, wherein: the secondary contact is linked to anacceleration pattern; and the default contact is linked to otherinstances of acceleration over the predefined acceleration threshold;receive the acceleration-triggered asymmetrical video communicationrequest from the video and audio capture device; determine the useraccount from the plurality of user accounts that is linked with thevideo and audio capture device from which the acceleration-triggeredasymmetrical video communication request was received; select thedefault contact from the default contact and the secondary contact withwhich to establish an acceleration-triggered asymmetrical videocommunication link based on detecting the acceleration of the housingover the predefined acceleration threshold and comparing to theacceleration pattern; establish the acceleration-triggered asymmetricalvideo communication link between the application executed by the mobiledevice of the contact and the video and audio capture device based onthe acceleration pattern, wherein: video and audio captured using thevideo and audio capture device are streamed via the cloud-basedcommunication server system to the application and audio captured by themobile device is streamed via the cloud-based communication serversystem to the video and audio capture device; and video is not streamedfrom the mobile device to the cloud-based communication server system orto the video and audio capture device.
 2. The asymmetrical videocommunication system of claim 1, wherein the cloud-based communicationserver system is further configured to: transmit a request to theapplication executed by the mobile device based on the application beinglinked with the user account in response to receiving theacceleration-triggered asymmetrical video communication request.
 3. Theasymmetrical video communication system of claim 2, wherein the mobiledevice is configured to: present a notification indicative of therequest in response to receiving the request for theacceleration-triggered asymmetrical video communication link; and inresponse to user input provided in response to the presentednotification, the application executed by the mobile device isconfigured to cause an activation request to be transmitted to thecloud-based communication server system.
 4. The asymmetrical videocommunication system of claim 3, wherein the cloud-based communicationserver system is configured to establish the acceleration-triggeredasymmetrical video communication link between the application executedby the mobile device and the video and audio capture device based onreceipt of the activation request.
 5. The asymmetrical videocommunication system of claim 3, wherein the cloud-based communicationserver system is further configured to transmit a picture captured bythe video and audio capture device as part of the notification, whereinthe picture was captured based on the video and audio capture devicedetecting the acceleration over the predefined acceleration threshold.6. The asymmetrical video communication system of claim 3, wherein thecloud-based communication server system is further configured totransmit a video clip captured by the video and audio capture device aspart of the notification, wherein: the video clip was captured based onthe video and audio capture device detecting the acceleration over thepredefined acceleration threshold; and the video clip temporally spansfrom before the video and audio capture device detected the accelerationover the predefined acceleration threshold until after the video andaudio capture device detected the acceleration over the predefinedacceleration threshold.
 7. The asymmetrical video communication systemof claim 1, wherein the acceleration pattern over the predefinedacceleration threshold is indicative of a user performing a sequence oftaps interspersed with at least one pause on the housing of the videoand audio capture device.
 8. The asymmetrical video communication systemof claim 1, wherein the acceleration-triggered asymmetrical videocommunication request to be transmitted to the cloud-based communicationserver system is transmitted by the video and audio capture device inresponse to detecting multiple distinct instances of accelerationgreater than the predefined acceleration threshold within a predefinedperiod of time.
 9. The asymmetrical video communication system of claim8, wherein the video and audio capture device is further configured to:transmit a pattern indication to the cloud-based communication serversystem, wherein the pattern indication indicates a number of distinctinstances of acceleration greater than the predefined accelerationthreshold within the predefined period of time.
 10. The asymmetricalvideo communication system of claim 9, wherein the cloud-basedcommunication server system is further configured to determine the useraccount from the plurality of user accounts at least partially based onthe pattern indication.
 11. The asymmetrical video communication systemof claim 1, wherein the video and audio capture device further comprisesa light and the video and audio capture device is further configured tochange an illumination state of the light in response to theacceleration-triggered asymmetrical video communication request beingtransmitted.
 12. A method for initiating and conducting asymmetricalvideo communication, the method comprising: receiving, by a cloud-basedcommunication server system, a request to log into a user account thatis linked with a video and audio capture device; receiving an indicationof a default contact and a secondary contact, wherein: the secondarycontact is linked to an acceleration pattern; and the default contact islinked to other instances of acceleration over a predefined accelerationthreshold; detecting, by the video and audio capture device, anacceleration of a housing of the video and audio capture device over apredefined acceleration threshold; transmitting, by the video and audiocapture device, an acceleration-triggered asymmetrical videocommunication request to the cloud-based communication server system inresponse to detecting the acceleration of the video and audio capturedevice over the predefined acceleration threshold, wherein the video andaudio capture device does not have a video display; receiving, by thecloud-based communication server system, the acceleration-triggeredasymmetrical video communication request from the video and audiocapture device; determining, by the cloud-based communication serversystem, the user account that is linked with the video and audio capturedevice from which the acceleration-triggered asymmetrical videocommunication request was received; selecting, by the cloud-basedcommunication server system, a contact from the default contact and thesecondary contact with which to establish an acceleration-triggeredasymmetrical video communication link based on detecting theacceleration of the housing over the predefined acceleration thresholdand comparing to the acceleration pattern; establishing, by thecloud-based communication server system, the acceleration-triggeredasymmetrical video communication link between the video and audiocapture device and an application executed by a mobile device linkedwithin the contact, wherein the mobile device comprises a display,microphone, and speaker; and streaming video and audio captured usingthe video and audio capture device via the cloud-based communicationserver system to the application and audio captured by the mobile deviceis streamed via the cloud-based communication server system to the videoand audio capture device.
 13. The method for initiating and conductingasymmetrical video communication of claim 12, further comprising:transmitting, by the cloud-based communication server system, a requestto the application executed by the mobile device based on theapplication being linked with the user account in response to receivingthe acceleration-triggered asymmetrical video communication request. 14.The method for initiating and conducting asymmetrical videocommunication of claim 13, further comprising: presenting, by theapplication of the mobile device, a notification indicative of therequest in response to receiving the request for theacceleration-triggered asymmetrical video communication link; andcausing, by the application an activation request to be transmitted tothe cloud-based communication server system in response to user inputprovided in response to the presented notification.
 15. The method forinitiating and conducting asymmetrical video communication of claim 14,further comprising: establishing, by the cloud-based communicationserver system, the acceleration-triggered asymmetrical videocommunication link between the application executed by the mobile deviceand the video and audio capture device based on receipt of theactivation request.
 16. The method for initiating and conductingasymmetrical video communication of claim 14, further comprising:transmitting, by the cloud-based communication server system, a picturecaptured by the video and audio capture device as part of thenotification, wherein the picture was captured based on the video andaudio capture device detecting the acceleration over the predefinedacceleration threshold.
 17. The method for initiating and conductingasymmetrical video communication of claim 14, further comprising:transmitting, by the cloud-based communication server system, as part ofthe notification, a video clip captured by the video and audio capturedevice, wherein: the video clip was captured based on the video andaudio capture device detecting the acceleration over the predefinedacceleration threshold; and the video clip temporally spans from beforethe video and audio capture device detected the acceleration over thepredefined acceleration threshold until after the video and audiocapture device detected the acceleration over the predefinedacceleration threshold.
 18. The method for initiating and conductingasymmetrical video communication of claim 12, wherein theacceleration-triggered asymmetrical video communication request istransmitted to the cloud-based communication server system by the videoand audio capture device in response to detecting multiple distinctinstances of acceleration greater than the predefined accelerationthreshold within a predefined period of time.
 19. The method forinitiating and conducting asymmetrical video communication of claim 18,further comprising: transmitting, by the video and audio capture device,a pattern indication to the cloud-based communication server system,wherein the pattern indication indicates a number of distinct instancesof acceleration greater than the predefined acceleration thresholdwithin the predefined period of time; and selecting, by the cloud-basedcommunication server system, the user account from a plurality of useraccounts at least partially based on the pattern indication.
 20. Themethod for initiating and conducting asymmetrical video communication ofclaim 12, further comprising: altering, by the video and audio device,an illumination state of a light in response to theacceleration-triggered asymmetrical video communication request beingtransmitted.